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NOVEL METHOD FOR SEQUENCE DETERMINATION USING NMR 

FIELD OF THE INVENTION 
The invention relates to methods for analyzing compositional and sequence 
5 information of polysaccharides. 

BACKGROUND OF THE INVENTION 
Heparin and heparan sulfate glycosaminoglycans are complex acidic 
polysaccharides that are involved in a variety of physiological and pathological 

10 conditions. Advances in different areas of biology have elucidated the potential roles of 
HSGAGs in key biological processes (Casu and Lindahl, 2001; Lindahl, 2000; 
Sasisekharan and Venkataraman, 2000; Shriver et al, 2002) including thrombosis 
(Petitou et al, 1999), angiogenesis (Sasisekharan et al, 1997), viral invasion (Chen et 
al, 1997; Fry et al, 1999; Shukla et al, 1999) and tumor growth (Hulett et al, 1999; 

15 Vlodavsky et al, 1999; Liu et al 2002). The repeat unit of a HSGAG polymer is a 
disaccharide comprising a uronic acid (U), which can exist in two different epimeric 
forms - a-L-iduronic (I) or p-D-glucuronic (G), linked l->4 to a a-D-glucosamine 
residue (A). There are variations within the disaccharide unit in the form of sulfation at 
the 2-0 position of the uronic acid, 3-0 and 6-0 position of the glucosamine and 

20 sulfation or acetylation of the N-position of the glucosamine (Casu and Lindahl, 2001). 

Perhaps the best studied structure-activity relationships in HSGAGs is a 
pentasaccharide sequence in heparin that specifically binds to and activates 
antithrombin-III thereby playing an inhibitory role in the blood coagulation cascade 
(Bourin and Lindahl, 1993). Heparin and its derivatives, low molecular weight heparins 

25 (LMWHs), are the most widely used clinical agents for prevention of deep vein 
thrombosis after surgery (Breddin, 2000) and for prevention of myocardial infarction 
after coronary invasion procedures (Cohen, 1999). Based on the anticoagulant properties 
of heparin, new therapeutic applications of heparin are being envisaged (Rosenberg, 
2001). A synthetic version of the pentasaccharide has been used as an antithrombotic 

30 drug (Turpie et al, 2001). 

In order to understand the structure-activity relationship of HSGAGs, several 
analytical tools have been developed for sequencing oligosaccharides which include gel 
electrophoresis (Turnbull et al, 1999), HPLC (Vives et al, 1999), matrix assisted laser 
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desorption/ionization mass spectrometry (MALDI-MS) (Venkataraman et aL 9 1999) and 
nanoelectrospray mass spectrometry (Pope et aL, 2001). These analytical tools have 
been applied to dissect the HSGAG oligosaccharide into smaller fragments using a 
battery of depolymerizing enzymes and other chemical methods and determining the 
5 sequence of the oligosaccharides based on specific properties of the smaller fragments 
(Kreuger et aL 2001). 

SUMMARY OF THE INVENTION 
The invention relates, in part, to an analytical tool for analyzing oligosaccharides, 

10 such as HSGAGs. Determining the sequence and/or composition of an oligosaccharide 
is helpful for elucidating the structure-function relationship of oligosaccharides in key 
biological processes. 

In some aspects, a method of determining the composition of an oligosaccharide 
is provided. The method involves obtaining a measurement of a first property of the 

15 oligosaccharide using NMR spectroscopy, and obtaining a measurement of a second 
property of the oligosaccharide by a second experimental method, wherein the first and 
second properties determine the composition. In one embodiment, the second property 
of the oligosaccharide is measured by capillary electrophoresis. 

A method of analyzing an oligosaccharide is provided according to other aspects. 

20 The method involves obtaining a measurement of a first type of disaccharide linkage of 
the oligosaccharide by a first experimental method, and obtaining a measurement of a 
second type of disaccharide linkage of the oligosaccharide by a second experimental 
method, to analyze the oligosaccharide. In one embodiment the first type of disaccharide 
linkage is measured by NMR spectroscopy. In another embodiment the second type of 

25 disaccharide linkage is measured by capillary electrophoresis. 

According to another aspect, a method of analyzing an oligosaccharide, by 
identifying a first property of the oligosaccharide by NMR spectroscopy, and identifying 
a second property of the oligosaccharide by capillary electrophoresis, to analyze the 
oligosaccharide is provided. 

30 The methods, in some embodiments, involve determining possible sequences of 

the oligosaccharide that are consistent with the measurement from the NMR 
spectroscopy and second experimental method. 
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In other embodiments the methods involve constructing a list of possible 
sequences based on the measurement from the NMR spectroscopy, and eliminating 
sequences from the list of possible sequences that are not consistent with the 
measurement of the second experimental method. 
5 The second experimental method may be used to distinguish the reducing and 

non-reducing ends of the oligosaccharide or fragments thereof. In one embodiment, the 
second experimental method includes chemical degradation. In another embodiment the 
second experimental method includes end-labeling. 

The second experimental method may also be used to determine the signature of 
10 the reducing end of the oligosaccharide or fragments thereof. In one embodiment the 
signature of the reducing end is determined with capillary electrophoresis. The second 
experimental method may allow the determination of the sulfation pattern of the second 
type of disaccharide linkage. 

The NMR spectroscopy, in some embodiments, includes the determination of the 
15 sulfation pattern of the oligosaccharide or fragments thereof, hi another embodiment the 
NMR is performed to identify and quantify both reducing and non-reducing ends. 

The methods may optionally involve obtaining a measure of an additional 
property of the oligosaccharide by a third experimental method to further eliminate 
sequences not consistent with measurements obtained from the third experimental 
20 method. 

In some embodiments the NMR spectroscopy is ID proton or 2D 
COSY/TOCSY. In other embodiments, the NMR spectroscopy is HSQC, DQF-COSY, 
NOESY or ROESY. The NMR spectroscopy may also be any combination of the above. 

The NMR spectroscopy may be performed on the oligosaccharide in its intact 
25 form. Alternatively, it may be performed on the oligosaccharide in a fragmented form. 
The fragmented form may be produced by enzymatic digestion. Enzymatic digestions 
may be complete or, in the alternative, partial. 

The second experimental method may also involve digesting the oligosaccharide 
to a fragmented form, e.g., optionally produced by enzymatic digestion. In one 
30 embodiment the enzymatic digestion is complete. 

In another aspect, a method of generating a list of possible sequences of an 
oligosaccharide is provided. The method involves defining a set of properties of the 
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oligosaccharide by performing NMR spectroscopy and a second experimental method, 
wherein the NMR spectroscopy provides a measurement of a first type of disaccharide 
linkage and the second experimental method provides a measurement of a second type of 
disaccharide linkage, and constructing a list of possible sequences based on the set of 
5 properties of the oligosaccharide. In one embodiment the NMR spectroscopy includes a 
measure of the monosaccharide composition of the oligosaccharide. 

The method may involve a data structure which represents the properties as non- 
character values. In one embodiment the data structure includes a value for each type of 
monosaccharide. In another embodiment the data structure encodes a value for each type 
10 of disaccharide linkage. In another embodiment the values are binary. 

A list of possible sequences of an oligosaccharide produced from the methods is 
also provided. 

In the above aspects and embodiments the labels of "first" and "second" 
experimental methods are not intended to denote the order in which the experiments need 

1 5 be performed. In some embodiments, the NMR spectroscopy may be performed before 
the other experimental methods, while in other embodiments, the NMR spectroscopy 
may be performed after the other method. In still other embodiments the two or more 
experimental methods may be performed concurrently. 

Each of the embodiments of the invention can encompass various recitations 

20 made herein. It is, therefore, anticipated that each of the recitations of the invention 
involving any one element or combinations of elements can, optionally, be included in 
each aspect of the invention. 

BRIEF DESCRIPTION OF THE DRAWINGS 
25 Figure 1 is a representation of the (A) disaccharide building block of heparin and 

heparan sulfate polysaccharides (left) with positions of sulfation marked as X (Y position 
can be sulfated or acetylated). The hexadecimal code (middle) to represent the 
disaccharide repeat unit in the PEN framework was described in our previous study 
(Venkataraman et ah, 1999). The decomposition of the disaccharide hexadecimal code to 
30 a base4 code for the uronic acid and base 8 (octal) code for the glucosamine is shown on 
in (B) The base4 codes for the (i) uronic acids, (ii) octal codes for glucosamines, (iii) 
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signed hexadecimal codes for A-U linkages and (iv) AU-A linkages observed in the 
oligosaccharides used in this study. 

Figure 2 shows the (A) *H 500 MHz spectrum of synthetic pentasaccharide PI. 
(B) ! H 500 MHz spectrum of synthetic pentasaccharide P2. Characteristic proton 
5 chemical shifts of the constituent monosaccharides are marked accordingly. 

Figure 3 illustrates the *H 500 MHz spectra of H10 decasaccharide (A). The line 
broadening is due to the presence of paramagnetic impurity; addition of deuterated 
EDTA provides a better resolved spectrum (B). The anomeric region expansion with the 
signals assignment is shown in (C). 

10 

DETAILED DESCRIPTION 
Heparin and heparan sulfate glycosaminoglycans (HSGAGs) are cell surface 
polysaccharides that bind to a multitude of signaling molecules, enzymes, pathogens and 
modulate critical biological processes ranging from cell growth and development to 

15 anticoagulation and viral invasion. Heparin has been widely used as anticoagulant in a 
variety of clinical applications for several decades. The heterogeneity and complexity of 
HSGAGs pose significant challenges to their purification and characterization of 
structure-function relationship. 

Several analytical techniques including gel electrophoresis, chromatography and 

20 mass spectrometry have been successfully applied to sequence small amounts of 
HSGAG oligosaccharides. The prerequisite for the success of most of these techniques is 
predictable and controlled depolymerization of HSGAG oligosaccharide into smaller 
fragments using a combination of enzymatic and chemical degradation methods. 
Importantly, for some of the sequencing strategies, the use of multiple exo-enzymes are 

25 required to accurately determine the different modifications of the disaccharide units. 

A methodology was developed that utilizes experimental methods to characterize 
oligosaccharides. The invention, relates in some aspects to a method for analyzing an 
oligosaccharide, such as an HSGAG. These methods, in some embodiments, encompass 
determining the presence or identity of an oligosaccharide in a sample through the 

30 analysis of the oligosaccharide. In other embodiments, methods for assessing the purity 
of an oligosaccharide in a sample are provided. The term "analyzing" as used herein 
refers to the identification of one or more properties of an oligosaccharide. In some 
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instances the analysis may be extensive and provide considerable information about the 
composition of an oligosaccharide. The term analyzing can encompass sequencing an 
oligosaccharide or determining the composition of an oligosaccharide. The term 
"determine the composition" refers to the identification of enough properties of an 
5 oligosaccharide that the oligosaccharide can be distinguished from other 
oligosaccharides. When the information regarding the properties is compiled using the 
numerical PEN framework derived by the instant inventors the sequence of an 
oligosaccharide, including order of building blocks and linkage information, can be 
compiled. 

10 The analysis of the oligosaccharide may involve identification of monomeric 

properties and linkage information. The importance of identifying the types of linkages 
between monosaccharides and disaccharides of oligosaccharides can be illustrated using 
HSGAGs as an example. The repeat unit of a HSGAG polymer is a disaccharide 
comprising a uronic acid (U) ? which can exist in two different epimeric forms - a-L- 

15 iduronic (I) of p-D-glucuronic (G), linked l->4 to a a-glucosamine residue (A). There 
are variations within the disaccharide unit in the form of sulfation or acetylation of the 
N-position of the glucosamine. Characteristic proton and 13 C chemical shifts have been 
identified for commonly occurring monosaccharides and heparin and the relative 
abundance of these monosaccharides can be quantitatively determined by integrating the 

20 proton signals. In addition to characterization of the monosaccharides, the anomeric 
proton signals of the glucosamines can be resolved further to identify linkage 
information regarding the neighboring uronic acid (A-U linkage) with a defined epimeric 
and sulfation state. 

The composition of an oligosaccharide can be determined by the methods 
25 described herein using two or more experimental techniques to measure properties of the 
oligosaccharide. One of the experimental techniques is nuclear magnetic resonance 
(NMR). By utilizing NMR the number of experimental constraints required to sequence 
HSGAG oligosaccharides can be reduced. The NMR can be utilized with at least one 
other type of experimental technique. It has been demonstrated herein that the 
30 integration of NMR and capillary electrophoresis (CE) data sets with the help of the PEN 
framework can reduce the need for NOESY/ROESY experiments which may require a 
lot more sample and more sophisticated instruments for reasonable sensitivity. 
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Thus, methods are provided to analyze the oligosaccharides to determine the 
sequence of the oligosaccharide. Using a numerical PEN framework, the information 
derived from the compositional analysis, e.g., two distinct linkage data sets, can be 
compiled to construct the sequence in a rapid, systematic and unbiased fashion. The 
5 numerical nature of the PEN framework facilitates moving between monosaccharide, 
disaccharide U-A and A-U linkage information using simple mathematical operations 
thus facilitating a systematic and unbiased way of rapidly arriving at a HSGAG 
sequence. 

The examples discussed below demonstrate the use of the methodology and the 
10 computational approach described herein. Examples 1 and 2 describe pentasaccharides 
containing both I2S and G. These examples demonstrate the value of determining the 
signature of the reducing end (methylation) for application of the analytical methods 
described herein. Once all of the A-U and U-A linkages are determined, the knowledge 
of the reducing or non-reducing end enables the determination of the sequence by 
15 moving forward or backward. Decasaccharide H10 is one of the most complex 
decasaccharides that has been characterized and verified using a combination of 
analytical tools. 

Example 3 highlights the strengths of the methods described herein in 
comparison with other analytical tools that have been used for characterization of H10 in 

20 the past. Earlier sequencing approaches for this decasaccharide required numerous steps. 
Using the methods described herein we arrived at the sequence of H10 in an unbiased 
fashion by obtaining two distinct sets of A-U and U-A linkage information that were 
quantitatively determined using a minimal set of experimental data. This example 
illustrates the flexibility of the computational method to construct a list of all possible 

25 combinations of sequences satisfying the linkage and monosaccharide composition 
obtained from NMR data and elimination of sequences that did not satisfy the CE data. 

Thus, the list of possible sequences can be constructed based on the identified 
properties, such as the charge, the nature and number of units of the oligosaccharide, the 
nature and number of chemical substituents on the units, disaccharide linkage, reducing 

30 and non-reducing ends, and the stereospecificity of the oligosaccharide, which reveal 
information about the building blocks of the oligosaccharide determined from the 
measurements of an experimental method. 
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The structural properties of oligosaccharides may provide useful information 
about the function of the oligosaccharide. For instance, the properties of the 
oligosaccharide may reveal the entire sequence of units of the oligosaccharide, which is 
useful for identifying the oligosaccharide. Similarly, if the sequence of the 
5 oligosaccharide was previously unknown, the structural properties of the oligosaccharide 
are useful for comparing the oligosaccharide to known oligosaccharides having known 
functions. The properties of the oligosaccharide may also reveal that a oligosaccharide 
has a net charge or has regions which are charged. This information is useful for 
identifying compounds that the oligosaccharide may interact with or predicting which 
10 regions of a oligosaccharide may be involved in a binding interaction or have a specific 
function. 

The invention is useful for identifying properties of oligosaccharides. A 
"property" as used herein is a characteristic (e.g., structural characteristic) of the 
oligosaccharide that provides information (e.g., structural information) about the 

1 5 oligosaccharide. A compilation of several properties of a oligosaccharide may provide 
sufficient information to identify a chemical unit or even the entire oligosaccharide but 
the property of the oligosaccharide itself does not encompass the chemical basis of the 
chemical unit or oligosaccharide. Due to the complexity of the oligosaccharide, a 
property may identify a type of monomelic building block of the oligosaccharide. The 

20 units of the oligosaccharides have more variables in addition to its basic chemical 

structure. For example, the oligosaccharide may be acetylated or sulfated at several sites 
on the building block, or it may be charged or uncharged. Thus, one property of an 
oligosaccharide may be the identity of one or more basic building blocks of the 
oligosaccharides. 

25 A basic building block alone, however, may not provide information about the 

charge and the nature of substituents of the saccharide or disaccharide. For example, a 
building block of uronic acid may be iduronic or glucuronic acid. Each of these building 
blocks may have additional substituents that add complexity to the structure of the 
building block. A single property, however, may not identify such additional substitutes 

30 charges, etc., in addition to identifying a complete building block of a oligosaccharide. 
This information, however, may be assembled from several properties. Thus, a property 
of an oligosaccharide as used herein encompasses a monosaccharide or disaccharide 
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building block of an oligosaccharide. The NMR methods described herein are useful for 
identifying information about basic monosaccharide building blocks. 

The type of property that will provide structural information about the 
oligosaccharide is a property such as charge, molecular weight, nature and degree of 
5 sulfation or acetylation, or type of saccharide. Properties include but are not limited to 
charge, chirality, nature of substituents, quantity of substituents, molecular weight, 
molecular length, compositional ratios of substituents or units, type of basic building 
block of a oligosaccharide, hydrophobicity, enzymatic sensitivity, hydrophilicity, 
secondary structure and conformation (i.e., position of helices), spatial distribution of 

10 substituents, ratio of one set of modifications to another set of modifications (i.e., relative 
amounts of 2-0 sulfation to N-sulfation or ratio of iduronic acid to glucuronic acid), 
binding sites for proteins, and linkage information. Other properties will easily be 
identified by those of ordinary skill in the art. A substituent, as used herein is an atom or 
group of atoms that substitute a unit, but are not themselves the units. 

15 It has been discovered that the use of NMR can significantly reduce the number 

of experimental constraints required to derive enough property information to identify a 
complete sequence of an oligosaccharide. By providing information regarding specific 
linkages and monosaccharide compositions NMR dramatically improves sequencing 
techniques. 

20 Thus, one embodiment of the methods utilizes the strengths of NMR to measure 

A-U linkages and an orthogonal set of U-A linkage information obtained from CE" to 
construct the sequence of an oligosaccharide. NMR is a powerful tool that can be used 
to determine numerous parameters defining the sequence of an intact oligosaccharide 
including monosaccharide composition, sulfation pattern and linkage between 

25 glucosamine and uronic acid (A-U). These parameters can be readily determined 
independent of sequence length and variability of building blocks, using a single series 
of simple ID proton and 2D COSY/TOCSY experiments. Thus, by combining the 
distinct linkage information between adjacent monosaccharides obtained from NMR (A- 
U linkage information) with the U-A linkage information obtained from a single 

30 capillary electrophoresis experiment to rapidly arrive at the sequence of HSGAG 
oligosaccharides. 
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Interpretation of NMR spectra of HSGAGs, in the past, has had certain 
limitations due to overlaps in proton signals and absence of measurable coupling 
constants. Also the sensitivity of this technique is lower than those based on detection of 
chromatographic effluents (Turnbull et al 9 1999; Vives et al, 1999; Kreuger et al, 2001) 
5 and on mass spectrometry (Pope et al, 2001 ; Venkataraman et al, 1999). Thus, it was 
very difficult to characterize samples that are only available in small quantities. It has 
been discovered that despite these limitations NMR can provide powerful analytical 
information useful in sequencing. 

NMR spectroscopy is an analytical tool that allows for the determination of 

10 molecular structure. Utilizing the magnetic properties of some nuclei, the nuclear spins 
of the nuclei can be oriented randomly with an external magnetic field. Oriented nuclei 
that are subsequently irradiated at the correct frequency will absorb energy and transition 
to a higher energy state. Upon relaxation this energy is emitted and detected in various 
NMR systems. This irradiation of the nuclei occur in pulses. In basic one dimensional 

15 (ID) NMR the excitation is produced from a single pulse and emitted radiation is 

detected as free induction decay (FID), hi two dimensional (2D) NMR spectroscopy the 
nuclei is irradiated with two pulses, and acquisition of the FID occurs at many time 
points with a delay between the pulses. 

There are many types of 2D spectroscopy which include: COSY, TOSCY, 

20 NOESY and ROESY. COSY (Correlated Spectroscopy) is helpful in determining the 
energy that is arising from neighboring protons. This is helpful if there is overlap or 
second order coupling. Spin-spin coupling in COSY allows the spectrum to yield 
through bond interactions. Another 2D NMR technique capable of measuring through 
bond interactions is TOSCY (Total Correlated Spectroscopy) which identifies the 

25 protons that produce signals within a spin system. COSY and TOSCY can be combined 
for a more powerful structural analysis. 

Other 2D NMR techniques allows the measurement of through space interactions. 
These methods are referred to as NOESY (Nuclear Overhauser Effect Spectrsocopy) and 
ROESY (Rotational Nuclear Overhauser Effect Spectroscopy). NOESY identifies the 

30 signals emitted from protons that are close in space by not directly connected by 

bonding. The NOESY spectra give through space correlations as the change in intensity 
of multiplets from neighboring nuclei upon irradiation can be extensively measured. In 
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instances where NOESY signals are weak, ROES Y can be used, a similar technique that 
has cross peaks that are only negative. Other techniques may be used and would be 
apparent to those of ordinary skill in the art. 

The NMR spectroscopy useful in the methods described herein may be ID proton 
5 or 2D COSY/TOCSY in some embodiments. ID proton NMR and 2D COSY/TOCSY 
NMR spectra provide quantitative information on multiple parameters including 
monosaccharide composition, sulfation states and A-U linkage information that define 
the sequence of an oligosaccharide. Furthermore, NMR provides an accurate method for 
direct quantification of the iduronic and glucuronic acid content in a sequence. 

10 In the method of capillary gel-electrophoresis, reaction samples may be analyzed 

by small-diameter, gel-filled capillaries. The small diameter of the capillaries (50 \xm) 
allows for efficient dissipation of heat generated during electrophoresis. Thus, high field 
strengths can be used without excessive Joule heating (400 V/m), lowering the separation 
time to about 20 minutes per reaction run, therefor increasing resolution over 

15 conventional gel electrophoresis. Additionally, many capillaries may be analyzed in 
parallel, allowing amplification of generated oligosaccharide information. 

Currently, saccharide fragments are detected in capillary electrophoresis by 
monitoring at 232 nm, the wavelength at which the A 4,5 double bond, generated upon 
heparinase cleavage, absorbs. However, other detection methods are possible. First, 

20 nitrous acid cleavage of heparin fragments, followed by reduction with 3 H-sodium 

borohydride yields degraded fragments having a 3 H radioactive tag. This represents both 
a tag which may be followed by capillary electrophoresis (counting radioactivity) or 
mass spectrometry (by the increase in mass). Another method of using radioactivity 
would be to label the heparin fragment with S 35 . Similar to the types of detection 

25 possible for 3 H4abeled fragments, S 35 labeled fragments may be useful for radioactive 
detection (CE) or measurement of mass differences (MS). 

Especially in the case of S 35 , this detection will be powerful. In this case, the 
human sulfotransferases may be used to label specifically a certain residue. This will 
give additional structural information. 

30 Nitrous acid degraded fragments, unlike heparinase-derived fragments, do not 

have a UV-absorbing chromophore. For CE, two methods may be used to monitor 
fragments that lack a suitable chromophore. First is indirect detection of fragments. We 
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may detect heparin fragments with our CE methodology using a suitable background 
absorber, e.g., 1,5-napthalenedisulfonic acid. The second method for detection involves 
chelation of metal ions by saccharides. The saccharide-metal complexes may be 
detected using UV-Vis just like monitoring the unsaturated double bond. 
5 For determining the reducing or non-reducing ends a variety of experimental 

methods may be used including chemical degradation, end-labeling and capillary 
electrophoresis. Li some embodiments, the ends may be determined by measuring a 
signature of a reducing or non-reducing end. For some oligosaccharides, the signature of 
the reducing end is methylation, and this signature may also be determined by capillary 

10 electrophoresis as well as other methods known in the art. 

The reducing end of an oligosaccharide may be distinguished from the non- 
reducing end using mass tags, for instance. All of these tags involve selective chemistry 
with the anomeric OH (present at the reducing end of the oligosaccharide), thus labeling 
occurs at the reducing end of the chain. One common tag is 2-aminobenzoic acid which 

15 is fluorescent. In general tags involve chemistry of the following types: (1) reaction of 
amines with the anomeric position to form imines (i.e., 2-aminobenzoic acid), hydrazine 
reaction to form hydrazones, and reaction of semicarbazones with the anomeric OH to 
form semicarbazides. Commonly used tags (other than 2-aminobenzoic acid) include the 
following compounds: 

20 1. semicarbazide 

2. Girard's P reagent 

3. Girard's T reagent 

4. p-aminobenzoic ethyl ester 

5. biotin-x-hydrazide 
25 6. 2-aminobenz amide 

7. 2-aminopyridine 

8. anthranilic acid 

9. 5-[(4,6-dichlorotriazine-2-yl)amino]-fluorescein 

10. 8-aminonaphthalene-l,3,6-trisulfonic acid 
30 11. 2-aminoacridone 

The properties, in some aspects of the invention, may be determined for the 
oligosaccharide in its intact or a fragmented form. Fragments of oligosaccharides in 
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some embodiments of the invention can be produced by enzymatic digestion. In some 
embodiments the digestion is complete, or it may be a partial digestion. 

Oligosaccharide fragments may be degraded using enzymes such as heparin lyase 
enzymes or nitrous acid and they may also be modified using different enzymes that 
5 transfer sulfate groups to the positions mentioned earlier or remove the sulfate groups 
from those positions. The modifying enzymes are exolytic and non-processive which 
means that they just act once on the non-reducing end and will let go of the heparin chain 
without sequentially modifying the rest of the chain. For each of the modifiable 
positions in the disaccharide unit there exits a modifying enzyme. An enzyme that adds 

10 a sulfate group is called a sulfotransferase, and an enzyme that removes a sulfate group is 
called a sulfatase. The modifying enzymes include 2-0 sulfatase/ sulfotransferase, 3-0 
sulfatase/sulfotransferase, 6-0 sulfatase/sulfotransferase and N-deacetylase-N- 
sulfotransferase. The function of these enzymes is evident from their names, for 
example a 2-0 sulfotransferase transfers a sulfate group to the 2-0 position of an 

15 iduronic acid (2-0 sulfated glucuronic acid is a rare occurrence in the HSGAG chains) 
and a 2-0 sulfatase removes the sulfate group from the 2-0 position of an iduronic acid. 

HSGAG degrading enzymes include heparinase-I, heparinase- II , heparinase-III, 
D-glucuronidase and L-iduronidase. The heparinases cleave at the glycosidic linkage 
before a uronic acid. Heparinase I clips at a glycosidic linkage before a 2 -O sulfated 

20 iduronic acid. Heparinase -III cleaves at a glycosidic linkage before an xmsulfated 

glucuronic acid. Heparinase -II cleaves at both Hep-I and Hep-III cleavable sites. After 
cleavage by the heparinases the uronic acid before which the cleavage occurs loses the 
information of iduronic vs. glucuronic acid because a double bond is created between the 
C4 and C5 atoms of the uronic acid. 

25 Glucuronidase and iduronidase, as their name suggests cleave at the glycosidic 

linkage after a glucuronic acid and iduronic acid respectively. Nitrous acid clips 
randomly at glycosidic linkages after a N-sulfated hexosamine and converts the six 
membered hexosamme ring to a 5 membered anhydromannitol ring. 

As used herein, the term "oligosaccharide" is used interchangeably with the term 

30 "polysaccharide". An "oligosaccharide" is a biopolymer comprised of linked saccharide 
or sugar units. As used herein with respect to linked units of a oligosaccharide, "linked" 
or "linkage" means two entities are bound to one another by any physicochemical means. 
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Any linkage known to those of ordinary skill in the art, covalent or non-covalent, is 
embraced. Such linkages are well known to those of ordinary skill in the art. Natural 
linkages, which are those ordinarily found in nature connecting the chemical units of a 
particular oligosaccharide, are most common. Natural linkages include, for instance, 
5 amide, ester and thioester linkages. The units of an oligosaccharide analyzed by the 
methods of the invention may be linked, however, by synthetic or modified linkages. 
Oligosaccharides where the units are linked by covalent bonds will be most common but 
also include hydrogen bonded, etc. 

The oligosaccharide is made up of a plurality of chemical units. A "chemical 

1 0 unit" as used herein is a building block or monomer which may be linked directly or 
indirectly to other building blocks or monomers to form an oligosaccharide. The 
oligosaccharide preferably is a oligosaccharide of at least two different linked units. An 
oligosaccharide is a biopolymer composed of monosaccharides linked to one another. In 
many oligosaccharides the basic building block of the oligosaccharide is actually a 

1 5 disaccharide unit which may be repeating or non-repeating. Thus, a unit when used with 
respect to a oligosaccharide refers to a basic building block of an oligosaccharide and 
may include a monomeric building block (monosaccharide) or a dimeric building block 
(disaccharide). 

A "plurality of chemical units" is at least two units linked to one another. 
20 The oligosaccharides may be native or naturally-occurring oligosaccharides which occur 
in nature or non-naturally occurring oligosaccharides which do not exist in nature. The 
oligosaccharides typically include at least a portion of a naturally occurring 
oligosaccharide. The oligosaccharides may be isolated or synthesized de novo. For 
example, the oligosaccharides maybe isolated from natural sources e.g. purified, as by 
25 cleavage and gel separation or may be synthesized e.g., by chemical synthesis. 

A data structure for representing the properties of the oligosaccharide is also 
provided. In some embodiments the data structure represents the properties as non- 
character values. These values in some embodiments can be a binary value. In some 
embodiments the building blocks are the types of monosaccharides and disaccharide 
30 linkages of the oligosaccharide. 

The rapid sequencing methodology for polysaccharides using chemical and 
enzymatic tools followed by numerical analysis techniques is described in detail in 
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Venkataraman, G., et al., Science, 286, 537-542 (1999), and U.S. Patent Applications 
Serial Nos. 09/557,997 and 09/558,137, both filed on April 24, 2000 and having 
common inventorship, all of which are specifically incorporated by reference. 

5 EXAMPLES 
Materials and methods 

The synthetic pentasaccharides PI and P2 corresponding to the active sequence 
of heparin for AT-III binding were a gift from M. Petitou, Sanofi-Synthelabo, Toulouse, 
France. The decasaccharide H10, kindly provided by RJ. Linhardt (University of Iowa), 
10 was obtained by fractionation of heparinase digest of pig mucosal heparin, on an AT-III 
column as described earlier (Toida et aL, 1996) 

NMR Spectroscopy: 

The oligosaccharide samples were prepared by dissolving 2 mg of the 
15 pentasaccharide and 150 jtig of H10 in 0.5 ml of D 2 0 99.99%. Due to signal broadening 
caused by paramagnetic ions in H10, deuterated EDTA was added to the sample to 
remove these ions. (Neville et al 9 1989) The ! H-NMR spectra were recorded at 500 
MHz on a Bruker AMX 500 spectrometer at 60 °C with presaturation of the residual 
water signals and with recycle delay of 12 seconds; a 45° pulse was used. 2D DQF- 
20 COSY (Double Quantum Filtered-COSY) and TOCSY were measured in phase-sensitive 
mode using TPPI (Time Proportional Phase Incrementation), and a shifted square sine- 
bell function was applied before Fourier transformation. 32 and 512 scans for each FID 
were used for the pentasaccharide and the decasaccharide, respectively. 

25 Compositional Analysis using capillary electrophoresis ( CE): 

Compositional analysis of the oligosaccharides was completed by exhaustive 
enzymatic digest of a 30 |uM sample followed by capillary electrophoresis (CE) as 
described earlier (Venkataraman et al, 1999). Briefly, to 1 nmol of oligosaccharide was 
added 200 nM of heparinases I, II, and III in 25 mM sodium acetate, 100 mM NaCl, 5 

30 mM calcium acetate buffer, pH 7.0. The reaction was allowed to proceed at 30°C 
overnight and then analyzed by CE in reverse polarity with a running buffer of 50 mM 
Tris/phosphate/10 juM dextran sulfate, pH 2.5. 
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Incorporating CE and NMR data as constraints using PEN framework: 

The PEN is a numerical notation scheme that encodes the sulfation pattern of a 
disaccharide building block as a series of binary on/off states and epiinerization of the 
uronic acid as a + or — sign bit leading to a signed hexadecimal coding scheme 
5 (Venkataraman et ah, 1999). Although, the PEN framework was originally developed to 
encode a U-A disaccharide building block (Di) ? it was mathematically decomposed into a 
base4 code for representing the uronic acid monosaccharide (U) and a bases code for 
representing the glucosamine monosaccharide (A) (Figure 1). Note that for the AU-A 
linkages the * used in the G/I position indicates that the epimeric state of the uronic acid 

10 is undetermined. It is also important to note that the signed hexadecimal codes 
representing the A-U linkages involves rearrangement of the 3 binary digits encoding A 
and two binary digits encoding U from the original PEN framework. As a result of the 
rearrangement, the + and — sign is used to represent 6-0 sulfation (where + represents 
unsulfated and - represents sulfated) of the glucosamine instead of the epimeric state of 

15 the uronic acid since the 6-0 sulfation is in the left most position of the A-U disaccharide 
code. Therefore there is no "extra" binary digit that has been added for representing the 
A-U disaccharide and we still use the signed basei6 hexadecimal code. Further the PEN 
framework was also used to encode an A-U disaccharide unit (Di') by transposing the 3 
bits that encode for the sulfation state of the glucosamine with the 2 bits that encode the 

20 epimeric and sulfation state of the uronic acid (Table 1). The information obtained from 
NMR and CE data is shown in A. Columns 1-3 indicate the number of linkages between 
glucosamine residues (colored in gray) the uronic acids in column 4 obtained from NMR 
data. Columns 5-7 indicate the linkages between uronic acid and the glucosamines 
obtained from CE data. The sequences that satisfy the monosaccharide composition and 

25 A-U linkage information (Di') are shown in B. Application of the U-A linkages from CE 
data reduces Lnmr to the final correct sequence. 

Table 1: Sequence assignment of H10 
A 



Linkage information from CE and NMR 
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10 



B 



A-U linkage from NMR 




U- A linkage from CE 


A* 


An Ac, 6X 


AnS,6X 


AU 2S 


AnS,6S 


AnAc,6S 


A* 








3 










2 


I2S 












1 


I 




1 






1 




G 






1 



Sequences 


satisfying NMR data CLmmr) 


Di' linkage 




Sequence 


a r rang erne nt 






AU -5 


-5 


-2 


-4 


A* 


+DDC-57 


AU -5 


-5 


-5 


-2 


A* 


+DDD4-7 


AU -5 


-2 


-5 


-4 


A* 


+DC-5D7 


AU -5 


-4 


D 


-2 


A* 


+DD5C-7 


AU -5 


-2 


-4 


-5 


A* 


+DC-5DF 


AU -5 


»4 


-2 


-5 


A* 


+DD4-5F 


AU -2 


-5 


-5 


-4 


A* 


+C-5DD7 


AU -4 


-5 


-5 


-2 


A* 


±D5DC-7 


AU -2 


-5 


-4 


-5 


A* 


+C-5D5F 


AU -4 


-5 


-2 


-5 


A* 


+D5C-5F 


AU -2 


-4 


-5 


-5 


A* 


±C-55DF 



Apply CE data 
as constraints 



Sequences satisfying 
CE and NMR Data 

£Lnmr,ce) 



1-D proton NMR spectrum along with the 2D COSY, HSQC (Heteronuclear 
15 Single Quantum Coherence) and TOCSY spectra provide data on the chemical shifts and 
coupling constants of most the ring protons of the constituent monosaccharides. This 
data was used to uniquely identify the monosaccharides (Ui and Ai) and obtain the 
number of monosaccharides for a given length of the sequence. In addition to the identity 
of the glucosamine monosaccharides, their characteristic anomeric chemical shifts were 
20 further resolved to identify their linkages to adjacent uronic acids (Ai-Ui linkages 
defining Di 5 disaccharides). The Ui, Ai and Di 5 information was used to build a list of all 
the possible sequences satisfying this data (L NM r). The sequences in L N mr represent a 
comprehensive sample space without any bias towards commonly occurring sequences. 

Disaccharide compositional analysis using CE provides accurate information on 
25 the sulfation pattern of a AU-A disaccharide, thus identifying all the U-A linkages (±Di) 
whose sign bit is not known due to the A 4 " 5 unsaturated bond. Incorporating the 
disaccharide linkages ±Di obtained from CE data eliminates most of the sequences from 
Lnmr converging on a single sequence. 
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Results 

Example 1: Pentasaccharide PI 

Several characteristic chemical shifts of the monosaccharide anomeric protons 
were observed in the ID proton NMR spectrum of the pentasaccharide (Figure 2A). The 
5 ID proton signals along with the 2D COSY and TOCSY spectra were used to assign the 
monosaccharides. The signal patterns at 5.648, 5.438 and 5.041 ppm were assigned to 
the anomeric protons of N-sulfated glucosamines (Ans,6x). Further, the 6-0 sulfation of 
all these glucosamines were confirmed by TOCSY. The signal at 3.43/57.5 ppm 
indicates the presence of an O-methyl group linked to the reducing terminal. In addition, 
10 the presence of the methyl group at the reducing end also accounts for the absence of the 
typical reducing end carbon chemical shift (92-93 ppm). The chemical shifts at 5.251 and 
4.635 are in agreement with an hs and G respectively. The anomeric proton signals at 
5.648 and 5.438 are distinguished further as arising from A N s,6s-l2s and A NS ,6S-G 
respectively. 

15 Integration of these peaks (Guerrini et al., 2001) gave the relative molar 

abundance of the glucosamines as A ns ,6s-I2s:A N s,6s-G:Ans,6s = 1:1:1 and that of the 
uronic acids as l2s:G =1:1. Thus from the ID and 2D NMR data the identity A* = [5 8 ; 5 8 ; 
58 (Ans,6s)] ? Uj = [I4 (hs)l 24 (G)] and relative abundance (m5=3 ? nl=n2=l) of the 
monosaccharides constituting the sequence were determined. Further the linkages Di' = 

20 [~5i6 (A N s,6S-l2s); -61 6 (A N s,6S-G)] were also obtained form NMR data. Based on this 
information from the NMR data there can be two possible (L N mr) pentasaccharide 
sequences: 5 8 1 4 5 8 2 4 5 8 (A N s,6S - I2S - A NS ,6s - G - A N s,6S,OMe) and 5 8 2 4 5 8 I4 5 8 (A N s,6S 

— G — Ans,6S - I2S — AisrS,6S,OMe). 

Capillary electrophoresis of the fragments formed by complete digestion of the 
25 pentasaccharide with the heparinases resulted in two peaks corresponding to a trisulfated 
AU2sAns,6s and a disulfated disaccharide AUA N s 9 6s thus defining ±Di = [±D] 6 ; ±5i 6 ]. The 
relative molar abundance of these two disaccharides was calculated as 1:1 by integration 
of the CE signals and normalizing the peak areas using an internal calibration. The 
migration time of the ±Di<5 disaccharide was slightly different from the standard 
30 indicating that the methylated glucosamine is a part of the trisulfated ±Di6 disaccharide. 
Thus the data from CE fixes the sulfation state of methylated reducing end disaccharide. 
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Incorporating the constraints from CE data eliminated one of the sequences from L NM r 
thus converging on 5 S 2 4 5 8 1 4 5 8 (A NS , 6 s - G - A NS , 6 s - hs - A NS , 6 s,OMe). 

Example 2: Pentasaccharide (P2). 
5 From the ID proton spectrum (Figure 2B) the signal pattern (anomeric peaks at 

5.64/99.6 ppm, 5.50/98.1 ppm) is consistent with N-sulfated, 6-O-sulfated glucosamines 
(A N s,6s) ? and A* bearing an extra 3-0 sulfate group. Similar to Example 1, the signal 
pattern at 3.43/57.5 ppm corresponds to a glucosamine with a methylated reducing end. 
Also the anomeric chemical shift at 5.64 ppm arises from a A N s,es linked to G as shown 

10 in Example 1. Signals at 5.20/101.7 ppm and 4.78/72.4 ppm agree with HI and H5 of I 2S 
residue, and the anomeric signal at 4.6/103.3 ppm with G (Mulloy and Johnson, 1987; 
Yates et al, 1996). Thus Ui = [1 4 (I 2 s); 2 4 (G)] and Ai = [5 8 ; 5 8 (A NS ,6s); 7 8 (A*)]. In this 
case we have only one of the two elements of Di' defined = [-6i6 (A NS ,6s-G)]. 
Incorporating the inferences from NMR data as constraints we get Lnmr = 2 sequences: 

15 5 8 2 4 7 8 1 4 5 8 (A N s,6s-G-A*-I 2 s-A N s,6S,oMe) and 7 8 1 4 5 8 2 4 5 8 (A*-l2s-A NS ,6s-G-A NS ,6s,OMe) 
The disaccharide composition analysis using CE resulted in a single peak 
corresponding to a trisulfated disaccharide with a shifted migration time indicating the 
presence of the methylated glucosamine (±Di = [±Die]). Using the data from CE one of 
the sequences from L NM r was eliminated to give the right sequence 5 8 2 4 7 8 1 4 5 8 (A NS ,6S- 

20 G-A*-I 2 s-A N s,6s,oMe). This sequence is consistent with the notion that the A N s,6S-G 
linkage is resistant to cleavage by heparinase I, II and III due to the presence of the 3-0 
sulfated A* (Shriver, Z et ah 2000) thus resulting only in a single disaccharide observed 
using CE. 

25 Example 3; Decasaccharide (H10): 

While the first two synthetic pentasaccharide examples clearly outline the 
methodology of our approach, the method is better illustrated by the longer and more 
complex heparin derived oligosaccharide (H10). This sequence is presently among the 
most complex heparin derived oligosaccharide sequenced to date. It is important to point 

30 out that much effort has gone into the isolation and sequencing of H10 (Toida et aL, 
1996). Due to its complexity, there were inaccuracies in its structure determination in the 
past and only recently using a combination of analytical tools, this sequence was 
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established (Venkataraman et aL, 1999; Shriver et al, 2000). NMR spectroscopy has 
been used in the past to corroborate its sequence (Shriver et al., 2000) but only the 
monosaccharide composition was established and there was bias in the interpretation of 
the NMR data based on the determined sequence. Using these examples we highlight the 
5 flexibility of our approach in providing an unbiased assignment of complex heparin 
derived oligosaccharide structures. 

The signal line broadening of the proton spectrum of H10 (Figure 3 A) is caused 
by the complexation of paramagnetic ions with the negatively charged groups. Addition 
of EDTA provides a better resolved spectrum (Figure 3B) by removal of these 
10 paramagnetic ions. (Neville, et al., 1989) 

The assignment of the anomeric signals (Figure 3C) and their respective proton 
patterns (Table 2), were carried out by COSY and TOCSY experiments. 

Table 2: *H chemical shifts of the constituent monosaccharides of the H10 sample. 

15 Chemical shifts are given in ppm downfield from trimethylsilyl propionate (TSP) as 



standard. 





AU 


AnS,6S 3 


I2S 3 


A NS 


I 


AnAc 


G 


A* 


HI 


5.521 


5.42 


5.20 


5.369 


5.019 


5.386 


4.668 


5.460 


H2 


4.635 


3.31 


4.34 


3.275 


3.767 


3.940 


3.415 


3.483 


H3 


4.314 


3.65 


4.22 


3.65 


4.126 


3.77 


3.71 


4.571 


H4 


6.001 


3. 11 (3. 82 If 


4.13 


3.77 


4.102 


3.75 


3.804 


4.040 


H5 




4.01 


4.87 


4.01 


4.829 


4.01 


3.71 


4.261 


H6 




4.40 




4.40 




4.35 




nd 



- two monosaccharide residues 



b - H4 chemical shift of the Ans,6s residue following the non reducing AU unit. 

20 The signals detected between 4.2-4.4 ppm are in agreement with H-6 proton from 

6-0 sulfated glucosamine. However since this chemical shift lies in the crowded area of 
the spectrum and due to the presence of minor impurities in the sample it was not 
possible to accurately determine the molar abundance of glucosamines containing the 6- 
O sulfate groups. However disaccharide compositional analysis of H10 using CE 

25 indicated the presence of 3 major disaccharide components - AU2s-A NS , 6S ? AU-Anac, 6S> 
AU-A* in the ratio 3:1:1, respectively, giving ±Di = [±D; ±4; ±7]. Thus the data from CE 
fixed the 6-0 sulfation of all the glucosamines. 
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The relative abundance of the glucosamine monosaccharides calculated by signal 
integration were 5 8 (A N s,6s): 7 8 (A*): 4 8 (A NAc ,6s) = 3:1:1, thus Ai = [5 8 ; 5 8 ; 5 8 ; 7 8 ; 4 8 ], 
The two a anomeric signals at 5.20 and 5.019 ppm arise from 2-0 sulfated and non 
sulfated iduronic acid, respectively, as demonstrated by the chemical shift pattern. The 
5 only P proton signal of the spectrum (at 4.669 ppm) belongs to a glucuronic acid residue. 
Protons at 6 ppm and 5.521 ppm belong to the H4 and HI of the AU residue. The H2 at 
4.635 ppm indicates that the unsaturated uronic acid residue is 2-O-sulfated. The relative 
abundance of the uronic acid monosaccharides was calculated as I 2 s: I: G: AU2S were 
identified in the ratio 2:1:1:1 respectively thereby defining Ui = [U; U; 0 4 ; 2 4 ; (*1>4] 

10 (where * stands for AU since this bit is not defined). 

The chemical shift of the signal at 5.369 ppm agrees with a Ans,6s linked to I. The 
X H anomeric chemical shift of a A NAc , 6S is distinct for A N ac,6S-I (5.14-5.18 ppm) and 
A NAc ,6s~G (5.30-5.36 ppm) linkages (Cohen, 1999; Chuang, et al, 2001). The anomeric 
proton of A N ac,6s at 5.386 ppm confirms the presence of Anac,6x-G linkage in the 

15 sequence. The chemical shift at 5.42 agrees with both Ans,6s linked to I2S and A NS ,6s at 
the reducing end. Since the AU residue is linked to an A NS ,6S unit and a second A N s,6S is 
linked to I, two possibilities are left for the reducing end, one with A NS ,6s and the other 
with A*. However, the chemical shift pattern associated with A* (HI, 5.464ppm; H2, 
3.480ppm; H3, 4.564ppm, H4, 4.041ppm) is the same as found by Yamada et al (1993) 

20 for a heparin tetrasaccharide with this residue at the reducing end (chemical shifts in the 
Yamada et al paper are systematically shifted about -0.03 ppm with respect to our 
values). Thus the signal pattern of A* is consistent with its location at the reducing end. 
Based on the relative abundance of these signals all the elements of Di' were defined as 
[-5i6; -5ie; -4i 6 ; -2i 6 ]. Translating the Ai, Ui and Di' to constraints using the PEN 

25 framework Lmmr = 12 sequences were obtained (Table 1). 

Eliminating the sequences from L NM r that do not contain disaccharide linkages 
corresponding to ±Di resultedin a single sequence DDD4-7 which is consistent with the. 
H10 sequence obtained earlier (Shriver, et al, 2001). 

30 The foregoing written specification is considered to be sufficient to enable one 

skilled in the art to practice the invention. The present invention is not to be limited in 
scope by examples provided, since the examples are intended as a single illustration of 
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one aspect of the invention and other functionally equivalent embodiments are within the 
scope of the invention. Various modifications of the invention in addition to those 
shown and described herein will become apparent to those skilled in the art from the 
foregoing description and fall within the scope of the appended claims. The advantages 
5 and objects of the invention are not necessarily encompassed by each embodiment of the 
invention. 

All references, patents and patent publications that are recited in this application 
are herein incorporated by reference. 
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We claim: 

CLAIMS 

1. A method of determining the composition of an oligosaccharide, comprising: 
obtaining a measurement of a first property of the oligosaccharide using NMR 

5 spectroscopy, and obtaining a measurement of a second property of the oligosaccharide 
by a second experimental method, wherein the first and second properties determine the 
composition. 

2. The method of claim 1, wherein the second property of the oligosaccharide is 
1 0 measured by capillary electrophoresis . 

3. A method of analyzing an oligosaccharide, comprising: obtaining a 
measurement of a first type of disaccharide linkage of the oligosaccharide by a first 
experimental method, and obtaining a measurement of a second type of disaccharide 

15 linkage of the oligosaccharide by a second experimental method, to analyze the 
oligosaccharide. 

4. The method of claim 3, wherein the first type of disaccharide linkage is 
measured by NMR spectroscopy. 

20 

5. The method of claim 3, wherein the second type of disaccharide linkage is 
measured by capillary electrophoresis. 

6. A method of analyzing an oligosaccharide, comprising: identifying a first 
25 property of the oligosaccharide by NMR spectroscopy, and identifying a second property 

of the oligosaccharide by capillary electrophoresis, to analyze the oligosaccharide. 

7. The method of claim 1 or 4, further comprising: determining possible 
sequences of the oligosaccharide that are consistent with the measurement from the 

30 NMR spectroscopy and second experimental method. 
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8. The method of claim 1 or 4, further comprising: constructing a list of possible 
sequences based on the measurement from the NMR spectroscopy, and eliminating 
sequences from the list of possible sequences that are not consistent with the 
measurement of the second experimental method. 

5 

9. The method of claim 1, 3, 4, 7 or 8, wherein the second experimental method 
distinguishes the reducing and non-reducing ends of the oligosaccharide or fragments 
thereof. 

10 10. The method of claim 9, wherein the second experimental method includes 

chemical degradation. 

1 1 . The method of claim 9, wherein the second experimental method includes 
end-labeling. 

15 

12. The method of claim 1, 3, 4, 7 or 8, wherein the second experimental method 
determines the signature of the reducing end of the oligosaccharide or fragments thereof. 

13. The method of claim 12, wherein the signature of the reducing end is 
20 determined with capillary electrophoresis. 

14. The method of claim 1, 2, 4, 6, 7 or 8 ? wherein the NMR spectroscopy 
includes the determination of the sulfation pattern of the oligosaccharide or fragments 
thereof. 

25 

15. The method of claim 1, 3, 4, 7 or 8, wherein the second experimental method 
allows the determination of the sulfation pattern of the second type of disaccharide 
linkage. 



30 16. The method of any one of claims 1-8, further comprising: obtaining a 

measure of an additional property of the oligosaccharide by a third experimental method 



WO 2004/055491 



PCT/US2003/015850 



-28- 

to further eliminate sequences not consistent with measurements obtained from the third 
experimental method. 

17. The method of claim 1, 2, 4, 6, 7 or 8, wherein the NMR spectroscopy is ID 
5 proton or 2D COS Y/TOCSY. 

18. The method of claim 1, 2, 4, 6, 7 or 8, wherein the NMR spectroscopy is 
performed on the oligosaccharide in its intact form. 

10 19. The method of claim 1, 3, 4, 7 or 8, wherein the second experimental method 

includes digesting the oligosaccharide to a fragmented form. 

20. The method of claim 19, wherein the fragmented form is produced by 
enzymatic digestion. 

15 

21. The method of claim 20, wherein the enzymatic digestion is complete. 

22. The method of claim 1, wherein the NMR is performed to identify and 
quantify both reducing and non-reducing ends. 

20 

23. A method of generating a list of possible sequences of an oligosaccharide, 
comprising: defining a set of properties of the oligosaccharide by performing NMR 
spectroscopy and a second experimental method, wherein the NMR spectroscopy 
provides a measurement of a first type of disaccharide linkage and the second 

25 experimental method provides a measurement of a second type of disaccharide linkage, 
and constructing a list of possible sequences based on the set of properties of the 
oligosaccharide. 

24. The method of claim 23, wherein the NMR spectroscopy includes a measure 
30 of the monosaccharide composition of the oligosaccharide. 
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25. The method of claim 23, further comprising a data structure which represents 
the properties as non-character values. 

26. The method of claim 25, wherein the data structure includes a value for each 
5 type of monosaccharide. 

27. The method of claim 25, wherein the data structure encodes a value for each 
type of disaccharide linkage. 



10 



28. The values of claim 25 ? wherein the values are binary. 
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Representation of monsaccharides and linkages using PEN 
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